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Review Article 


Lithium Prophylaxis in Recurrent Affective Illness 
Efficacy, Effectiveness and Efficiency 


R. GUSCOTT and L. TAYLOR 


The efficacy of lithium prophylaxis for recurrent mood disorders is well established. Despite 
concern that the later efficacy studies have shown poorer results, these studies (after 1980) 
are equally confirmatory of lithium’s efficacy. However, questions have been raised with regards 
to the effectiveness of lithium prophylaxis under ‘ordinary’ clinical conditions. Part of the 
confusion stems from the failure to distinguish clearly efficacy (the potential of a treatment) 
from effectiveness (the results obtained under clinical conditions). Studies of effectiveness 
or naturalistic studies show poorer results than efficacy studies in all areas of medicine. 
The major reason for this discrepancy with lithium prophylaxis is poor compliance. Estimations 
of the efficiency (cost benefits) of lithium prophylaxis are flawed by the failure to consider 
such issues. It is proposed that specialised lithium or mood disorders clinics have the potential 
to narrow the gap between efficacy and effectiveness - efficiency. 


It is well documented that recurrent mood disorders 
(including bipolar and unipolar disorder) result in 
a significant degree of morbidity, mortality and 
economic loss (Goodwin & Jamison, 1990). There 
is also suggestive evidence that affective illness is 
increasing in prevalence since 1940 (Gershon et al, 
1987; Joyce et al, 1990). The introduction of lithium 
salts for the prevention of recurrences of affective 
illness has been hailed as a major advance in modern 
psychiatry if not in all medicine (Goodwin & 
Jamison, 1990, p. 665). Some of the early studies 
could even be said to provide dramatic confirmation 
of the benefits of lithium prophylaxis (Baastrup et al, 
1970) despite some initial scepticism (Johnson, 1984). 
However, there is concern that later efficacy studies 
are showing a poorer response rate. Recent reports 
on lithium prophylaxis question whether the benefits 
of lithium expected on the basis of the early 
prospective trials are realised in ‘ordinary’ clinical 
practice (Markar & Mander, 1989; Dickson & Kendell, 
1986). The contrast between the early efficacy studies 
and the later, less optimistic reports raises serious 
questions about the current use of lithium in clinical 
practice. Indeed in a 1987 editorial in the Lancet, 
serious doubts were raised about the value of 
maintenance lithium. 

In this review we address these issues systematically. 
We first review the findings of the early and 
later efficacy studies, and address some of the 
methodological issues involved. We then address 
the issue of the effectiveness of lithium pro- 
phylaxis in ‘ordinary’ clinical practice and then 
proceed to discuss briefly its efficiency. Finally, 
we discuss the implications of our findings for 


service delivery, quality assurance and future 
research. 


Efficacy 


It is important to distinguish conceptually efficacy 
from effectiveness. Efficacy refers to the therapeutic 
potential of a treatment. Efficacy studies answer the 
question as to whether a treatment can work, and 
are usually double-blind randomised controlled trials 
(Tugwell et al, 1986). 

The most important early open trials of lithium 
prophylaxis were that of Baastrup & Schou (1967) 
and Angst et a/ (1970). Controlled trials have been 
confirmatory of lithium’s prophylactic efficacy. In 
a recent review of the ten major double-blind studies 
comparing lithium prophylaxis to placebo in bipolar 
disorder, it was concluded that the data showed a 
34% relapse rate on lithium compared with 81% 
relapse rate on placebo (Goodwin & Jamison, 1990, 
pp. 686-687). Nine out of 10 studies reported a 
statistically significant difference between lithium 
and placebo. The tenth study, however, had only 
seven patients in the lithium group (Melia, 1970). 

To examine the question of lithium efficacy ‘then 
and now’, we completed a comprehensive comparison 
of the early studies (Baastrup ef al, 1970; Coppen 
et al, 1971; Prien et al, 1973a,b; Cundall et al, 1972; 
Hullin et al, 1972; Stallone et al, 1973; Dunner et 
al, 1976; Fieve et al, 1976; Quitkin et al, 1978) to 
the later studies (Ahlfors et a/, 1981; Quitkin ef al, 
1981; Kane et al, 1982; Prien et al, 1984; Watkins 
et al, 1987) on a number of variables including: 
centre, patient demographics, previous course of 
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illness, study design, length of follow-up, outcome 
measured, monitoring of compliance, sample size 
and drop-outs, statistical analysis and conclusions 
drawn. We will summarise the more interesting 
similarities and differences. 

First, the earlier studies provided more information 
about the patients’ previous course of illness and 
comparatively they appeared to be a sicker group. 
For example, the patient population in Baastrup et 
al (1970) had a range of 2-20 illness episodes, with 
a mean of 8. Cundall’s study (1972) had a range of 
3-21, with a mean of 10. The newer studies tended 
to set a minimum requirement for entry into the 
study. Quitkin et al (1981) documented the previous 
course as 1-4 episodes; Prien et a/ (1984) a median 
of 7 for bipolars, 4 for unipolars; and Watkins et 
al (1987) did not document previous course at all. 
The earlier studies had more frequent follow-up of 
compliance. That is, in 5 of 11 studies, lithium levels 
were drawn every 4 weeks for extensive periods of 
time (12-40 months). This was done in only | of 5 
of the later studies for up to 24 months. 

The drop-out rates were comparable with a mean 
drop-out rate of 38% in the earlier studies and 41% 
in later studies. The drop-outs were examined more 
closely. In the early studies, a large percentage of 
the patients on placebo dropped out because of 
illness. For example, Fieve et al (1976) had 17 
patients receiving lithium, with two drop-outs 
unrelated to illness, whereas 18 patients received 
placebo, all of whom dropped out, 17 because of 
illness. Prien ef al (1973) had 41 patients on placebo drop 
out because of mood disturbance compared with 11 
on lithium. Stallone et a/ (1973) had 31 of 52 patients 
drop out: 22 were receiving placebo, of whom 18 had 
a mood disturbance; 9 were receiving lithium; and 
3 of those suffered a mood disturbance. 

In one later study, 39 out of 75 patients dropped out 
(Quitkin ef a/, 1981): 35 were euthymic - 19 on lithium 
alone and 16 on a combination of lithium and imi- 
pramine. The remaining 4 left for non-affective 
reasons. Kane et a/ (1982) stated they had more drop- 
outs on lithium because they were well. In the studies 
by Prien et a/ (1984) and Watkins (1987), it was not 
clear how many drop-outs there actually were. 

Therefore, the later studies made lithium efficacy 
appear worse. They had a larger number of drop- 
outs on lithium but it was postulated they were 
euthymic at the time. 

The studies also tended to measure different 
things. In 8 of 11 of the earlier studies, lithium was 
compared with placebo and one was a mirror-image 
study (pre- and post-lithium). The consensus over- 
whelmingly supported the efficacy of lithium. The two 
remaining studies compared lithium with placebo, 
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imipramine and a combination of lithium and imi- 
pramine, again noting the efficacy of lithium. Of the 
five later studies, lithium was compared with fluanxol 
in One, carbamazepine in another and the remaining 
three studies compared combination therapy; that 
is, lithium versus imipramine versus placebo versus 
lithium plus imipramine. Again, the efficacy of lithium 
was upheld. Studies of similar design (early and later) 
had very similar conclusions. In one study which 
showed a low response rate to lithium, the researchers 
hypothesised that poor-prognosis patients may have 
been selected (Ahlfors ef a/, 1981). 

In summary, the evidence is that both the early 
and later studies together provide very strong support 
for the efficacy of lithium, despite methodological 
differences. 


Effectiveness 


Effectiveness must be clearly distinguished from 
efficacy, a conceptual differentiation which is not 
often clear in the lithium literature. Effectiveness 
refers to actual results obtained in ‘ordinary’ clinical 
practice. Effectiveness depends upon five variables — 
efficacy, diagnostic accuracy in the field, health 
professionals’ compliance, patients’ compliance, and 
coverage (Tugwell ef ai, 1986, p.47). We have 
already discussed the issue of efficacy. 


Diagnostic accuracy 


For a treatment to be effective in ordinary clinical 
practice, clinicians must correctly identify those who 
might benefit from the treatment. Two types of error 
are possible, lessening the effectiveness of a treatment: 


(a) Patients who would benefit from treatment are 
not diagnosed. 

(b) The indications for the treatment are 
broadened, resulting in patients who are less 
likely to benefit being given the treatment. 


There is some evidence of a diagnostic shift in North 
America away from the diagnosis of schizophrenia 
to affective illness. Grof (1987) considered this to be 
one of the factors explaining the more pessimistic 
results from the effectiveness studies. However, there 
is also evidence that many patients who could benefit 
are not correctly identified or offered treatment 
(Schou & Weeke, 1988; Schou, 1988). 


Patients’ compliance 


Patient compliance in psychiatry is a much under- 
studied phenomenon (Guscott & Grof, 1991). 
Unfortunately, as pointed out by Goodwin & Jamison 
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(1990, p. 752), of the more than 10 000 articles written 
about lithium, less than 50 deal with compliance in a 
substantial way. In their review of the literature, rates 
of non-compliance varied from 18 to 53%. They 
conclude: 


(a) non-compliance has severe consequences — 
relapse, with all the associated illness sequelae 

(b) to be effective, the drug must be taken con- 
tinuously on a long-term basis 

(c) non-compliance is possibly the most common 
critical problem with lithium treatment 

(d) poor compliance is the major factor accounting 
for treatment failure, hardly a surprising 
finding (Goodwin & Jamison, 1990, p. 746). 


Health professionals’ compliance 


Health professionals’ compliance with the appropriate 
initiation and supervision of lithium prophylaxis is a 
virtually unstudied phenomenon. Despite both an 
extensive MED-LINE search and a literature search 
through the Lithium Information Center in 
Wisconsin, the authors were unable to find a single 
paper dealing with this topic systematically. There is 
evidence from other areas of medicine that infrequent 
use of a technology is associated with poorer 
therapeutic outcomes (Tugwell et a/, 1986, p. 47). 
Whether or not this applies to the use of lithium 
is not clear. 


Ambiguity of professional versus patient 
compliance 


Although the difference between health professionals’ 
compliance and patients’ compliance (extent to which 
patients adhere to therapeutic regimens) may seem 
intuitively obvious, there are clear areas of ambiguity. 
For example, from the patients’ point of view, the 
most important reasons for non-compliance were 
dislike of drugs controlling their moods, disliking the 
idea of chronic illness symbolised by need for drug 
therapy, depression and side-effects (Jamison ef al, 
1979). If the clinician fails to comply with accepted 
proven strategies to address these patient reasons for 
non-compliance (i.e. education of patient and family, 
treatment of breakthrough depressions, carefully 
monitoring and treatment of side-effects), and the 
patient then is defined as ‘non-compliant’, is this then 
patient or provider non-compliance? 

Clearly, the relationship between patient and 
provider compliance is multifaceted, ambiguous 
and in short, little studied. 
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Coverage 


Coverage refers to the extent to which a treatment 
is appropriately applied to a patient group who could 
benefit from the treatment. Appropriate utilisation 
of treatment depends upon accessibility, awareness of 
the availability of treatment and acceptability 
(Tugwell et al, 1986, p. 48). Although there are 
utilisation data on lithium usage in the community 
(McCreadie & Morrison, 1985; Schou, 1989), such 
data as exist, as Mogens Shou points out ‘‘. . . do 
not tell us if lithium was used too much or too little’ 
(Schou, 1989, p. 576). There is some evidence from 
epidemiological surveys that bipolar disorder is 
undertreated. For example, data from the ECA 
studies indicate that bipolar disorder is the most 
undertreated of all major psychiatric disorders 
(Shapiro et al, 1984). Goodwin & Jamison (1990, 
p. 3) estimate that only one-third of persons with 
bipolar disorder are currently receiving treatment in 
the USA. 

Access to appropriate psychiatric services is clearly 
a major variable determining utilisation. Both patient 
and health-care provider awareness of the availability 
and efficacy of lithium prophylaxis is a second major 
variable. Finally, the third variable, acceptability of 
treatment, is clearly relevant to lithium treatment. 
Indeed, the majority of the public do not recognise 
mood disorders as true medical illnesses requiring 
biological interventions. 


Evaluating the community effectiveness of 
lithium prophylaxis 


To assess the effectiveness of lithium prophylaxis 
globally in the community, all five variables discussed 
above would need to be estimated. There is, however, 
no research which addresses the issue of lithium 
prophylaxis from such a global perspective of 
effectiveness. As pointed out in other contexts, such 
studies, if they were available, would likely show 
both international and regional variation (Tugwell 
et al, 1986, p. 49). 

There are a number of ‘naturalistic’ studies, both 
epidemiological and management trials, which raise 
serious concerns about the benefits of lithium in 
ordinary clinical practice (Dickson & Kendell, 1986; 
Page et al, 1987; Maj et al, 1989; Markar & Mander, 
1989; Harrow et al, 1990; Tohen et al, 1990a,b; 
Aagaard & Vestergaard, 1990; O’Connell et al, 1991). 

Dickson & Kendell (1986) studied hospital discharge 
rates for mania, depression and schizophrenia in 
Edinburgh retrospectively over a 12-year period. They 
found the rates for mania rose dramatically, 
depression less so, and schizophrenia discharges 


744 


actually declined. After careful consideration of 
factors which might account for these changes despite 
the 10-fold rise in lithium usage, the authors raised 
the concern of lithium’s benefits in ordinary clinical 
practice. This study led to an editorial in the Lancet 
(1987, p. 424) raising doubts about the value of 
maintenance lithium. There was considerable criticism 
of this study (Grof, 1987) and the pessimistic Lancet 
editorial (Coppen, 1987). 

A second study by Marker & Mander (1989), 
wrongly described by the authors as an efficacy study 
(it is in fact an effectiveness study), raised further 
concerns about lithium benefits in ‘ordinary’ clinical 
practice. Using a naturalistic design, their conclusions 
were that lithium prophylaxis ‘‘does not confer the 
statistically significant benefit that would be anticipated 
from prospective clinical trials’? (Markar & Mander, 
1989, p. 499). Other studies, of naturalistic follow- 
up design, also showed a poor to modest benefit for 
long-term lithium prophylaxis (Page et al, 1987; Maj 
et al, 1989; Aagaard & Vestergaard, 1990; Harrow ef 
al, 1990; Tohen et al, 1990b; O’Connell et a/, 1991). 
The poorer outcome of these naturalistic studies 
compared with the early efficacy studies is to be 
expected, however. As Sackett & Gent (1979) point 
out, when treatment is controlled (as in the efficacy 
studies), the study population is likely biased in their 
ability to comply with treatment. Naturalistic studies 
more accurately reflect what is going on ‘in the field’. 

These considerations must be paramount in 
examining the effectiveness of lithium in ordinary 
clinical practice. Other factors have been identified 
as important: 


(a) diagnostic shift 

(b) tertiary-care referral bias 

(c) intermittent compliance (causing rebound 
episodes, increasing episode frequency or 
causing lithium resistance) 

(d) a change in the nature of the disorder 

(e) the overuse of antidepressants. 


However, we would submit that compliance, patient 
and provider, are the most likely explanations. In 
short, if patients do not take treatment with good 
medical supervision, they will not stay well. 


Efficiency and economy 


Our review of the differences between efficacy and 
effectiveness suggests that the alleged economic 
benefits of lithium prophylaxis may be vastly over- 
estimated. Reifman & Wyatt (1980), in an optimistically 
entitled paper ‘‘Lithium: A brake in the rising cost of 
mental illness’’, estimated medical cost savings between 
1969 and 1979 of $2.88 billion, and $1.28 billion gains 
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in production-a $4 billion dollar saving. It is 
difficult, when looking at their assumptions, to 
believe such figures, although the economic potential 
of lithium treatment we would agree is enormous. 
Firstly, they looked at treatment for 6 months only; 
secondly, only in patients who entered the hospital 
system; and thirdly, they assumed a 60% success rate 
(a figure they thought conservative). However, our 
own review of lithium effectiveness in the community 
leads us to conclude that only one-third of patients 
will remain compliant. Furthermore, from a public 
health perspective, it must be emphasised that if 
Goodwin & Jamison are correct, only one-third of 
bipolar patients are currently receiving treatment. 
ECA data support this contention, showing it to be 
the most undertreated of all psychiatric disorders. 
The difference between potential and actual economic 
benefits (however defined) is likely to be considerable. 
A study from Scotland by McCreadie (1987) was 
similarly optimistic, but the short-term savings 
reported must be considered in the context of the 
abysmal record of lithium compliance after one-year 
surveillance. 


Discussion 


We think it is worth emphasising the two major 
points of our review: 


(a) The gap between the efficacy and effectiveness 
of lithium prophylaxis indicates that there are 
serious deficiencies in current modes of lithium 
surveillance. (The ‘ordinary conditions of 
practice’.) 

(b) Efforts directed at lessening that gap should 
lead to improvements in the efficiency and 
economy of lithium treatment. 


It must be pointed out that the gap between 
efficacy and effectiveness is a universal phenomenon 
in medicine. Such gaps between efficacy and effective- 
ness cannot be accepted with equanimity, however. 
Indeed, if we are witnessing what has been referred 
to in the New England Journal of Medicine as the 
‘Third revolution in medical care — assessment and 
accountability’’, such gaps will not be accepted 
without concern by either governments or third-party 
insurers (Relman, 1988). 

The poorer outcome in the lithium effectiveness 
studies raises serious questions about service delivery 
and quality assurance. That is, what is the optimal 
clinical setting for the supervision of lithium prophy- 
laxis? We think that lithium prophylaxis cannot be 
evaluated independently of the context in which it 
is administered. It is not the lithium alone but the 
mode of service delivery which confers the benefits 
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of lithium in recurrent affective illness. It should be 
pointed out that in both Dickson & Kendell’s study 
(1985) and Markar & Mander’s study (1989), lithium 
treatment was supervised by family physicians with 
phone reminders for lithium levels. One should not 
be surprised if patient and provider compliance is 
low. Lithium prophylaxis is a treatment for the long- 
term, requiring patient and health-care provider 
commitment, psychoeducation for patient and family, 
specific psychotherapy for the illness, ensuring 
compliance, and the prompt treatment of side-effects. 

What might prove very useful is research correlating 
long-term outcome with mode of lithium surveillance, 
1.e. specialised clinic versus family doctor/internist 
follow-up versus general psychiatric clinic. However, 
we think the setting which most approximates the 
conditions of the early efficacy studies is the 
specialised lithium or mood disorders clinic. 

The feasibility of specialised mood disorders 
services or the evaluation of alternative service 
delivery modes should be a high priority for health- 
care providers. 
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